Spatial Semantics and Classifier Cascades

نویسندگان

  • Lexing Xie
  • Xuming He
چکیده

In this paper, we describe the system developed by the Australian National University (ANU), National ICT Australia (NICTA) for multimedia event detection applied to the TRECVID-2011 video retrieval benchmark. Our system uses five audio and visual features, leverages training events with cascaded classifier training, and sees performance improvements with spatial semantic features. A summary of our submitted runs can be found below: Run number Short hand Description pRun01 01_combined_all Combination from all features, with classifier bagging, and cascade training from SIFT. cRun02 02_combined_nocas Same as Run01 except without cascade training. cRun03 03_combined_visual Run01 without the two audio features. cRun04 04_combined_single Combination from all features, without classifier bagging, with SIFT. The best run from our system ranks fourth in mean-ActualNDC, and third in meanF1 metric, averaged over all ten events among sixty runs from nineteen teams.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Structural patterns of information cascades and their implications for dynamics and semantics

Information cascades are ubiquitous in both physical society and online social media, taking on large variations in structures, dynamics and semantics. Although the dynamics and semantics of information cascades have been studied, the structural patterns and their correlations with dynamics and semantics are largely unknown. Here we explore a large-scale dataset including 432 million informatio...

متن کامل

Combined scattering for rotation invariant texture analysis

This paper introduces a combined scattering representation for texture classification, which is invariant to rotations and stable to deformations. A combined scattering is computed with two nested cascades of wavelet transforms and complex modulus, along spatial and rotation variables. Results are compared with state-of-the-art algorithms, with a nearest neighbor classifier.

متن کامل

Quantifying Structural Patterns of Information Cascades

Information cascades are ubiquitous in both physical society and online social media, taking on large variations in structures, dynamics and semantics. Although there has been much progress on understanding the dynamics and semantics of information cascades, little is known about their structural patterns. In this paper, we explore a large-scale dataset including 432 million information cascade...

متن کامل

Spatial Narration in Amir Naderi's New York Trilogy

This article is concerned with the relationship of language and city in Amir Naderi’s trilogy of films on New York, comprising of Manhattan by Numbers (1993), A, B, C… Manhattan (1997), and Marathon (2002). By dint of a narrative relied on spatiality, he is in fact able to causally link the solitude and the spectral existence of his protagonists to the lack of a common language for reco...

متن کامل

Fast classification using sparse decision DAGs

In this paper we propose an algorithm that builds sparse decision DAGs (directed acyclic graphs) from a list of base classifiers provided by an external learning method such as AdaBoost. The basic idea is to cast the DAG design task as a Markov decision process. Each instance can decide to use or to skip each base classifier, based on the current state of the classifier being built. The result ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011